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ABSTRACT 

This paper describes the development of the 
English-Second-Language Placement Test (EPT) 100-200-300, which 
places adult students into the first three levels of ESL classes, and 
discusses work done on EPT 400-500-600, an experimental test to place 
students in the last three levels of classes* A structured v test, EPT 
100-200-300 tests ability to read items already practiced orally* 
One-page pretests were standardized on students at Alemany Adult 
School in San Francisco* Subsequent work was done under a Federal 
Grant to establish norms and to make a gains study on small city day 
schools and on night schools in urban and migrant worker areas* 

Scores of adult non-academic students were compared with those of 
college students enrolled in English classes for the foreign born at 
San Francisco State College* The new standardized tests are called 
EPT 100-200-300 forms A and B; each contains 50 items, takes a half 
hour, and provides a placement table* EPT 400-500-600 is still 
experimental but students successfully passing this test should be 
able to compete in Adult High Schools or College Foreign Born 
programs* (author/eb) 
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I want to describe my development of EPT 100-200-300, a 

standardized test which places adult second-language otudents 

the first three levels of English-Second Language classes and to 

d" ' A us s work done on EPT 400-500-600, an experimental test X 

developed with Mrs. Jeanette Anderson Best and Mrs. Virginia Biagx 

to place students in the last three levels of English-Second-Language 

classes. EPT (English-Second Language Placement Test) 100-200-300 

has an overall reliability of .96 and form correlation of .93. 

EPT 400-500-600 is still experimental. 

Ten years ago when I first began teaching English as a second 

language in an adult program in San Francisco, I was surprised at the 
utter chaos that existed in our classes. I wondered why we had so 
many students of such diversified abilities and backgrounds in each 
class. It seemed only logical to me as well as to many other teachers 
that we needed a test that would be able to place students better 

than the hit or miss interview in the main office. 

Of course at the beginning, I never realised that we would have 

to develop our own tests. X naturally assumed that we only had to find 
placement tests for English-Second-Language classes and Just administer 
them. We soon discovered, however, that tests for adult students weic 
either directed at first-language learners, to place students in grade 
levels or at college oriented foreign born students . Until our 
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students reached the upper levels In our school, they were unable 
to take the tests. The majority of our students were in the lower 
levels . 

I developed the test and supervised the testing work of Mrs. 

Best and Mrs. Biagi without knowing that classroom teachers usually 
didn’t make standardized tests. This is considered to be the sovereign 
domain of testing services and University professors highly trained in 
psycholinguistics and testing. However, their interests apparently lie 
in other fields that may be lush and green, but that are barely visible 
from classrooms of the non-academic adult English-Second-Language 
learners. We were on the spot and needed a classifying instrument 
which could be administered to large groups of students in a short 
time and which could be graded by clerical personnel. 

I believe that when a student is placed at a level where he can 
realistically handle material and feel that he is succeeding and 
learning, he will stay in a class and will become more proficient in 
English. A teacher can do a better job of structuring lessons, setting 
realistic objectives, and providing a better learning climate when his 
students are more or less at the same level of ability. At the 
beginning levels most of the work is oral and it is difficult to 
provide interesting and challenging work when the students’ range of 
abilities is too great. Until teachers get good sequentially developed 
programmed materials designed to meet the individual language problems 
of each student and each first language background, no machines or 
small grouping will adequately solve the present problem found in 
heterogeneous classes of wide ranges of ability. 

My rationale for making a structure test was that most second- 
language learning materials are built around structural patterns which 
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are sequentially presented. Vocabulary is limited in early levels 
of ESL instruction. Reading and writing are usually confined to 
copying and manipulating oral patterns already mastered. 

I made an objective multiple choice test in a written form 
because it was easier to make, easier to take, and because we needed 
a short test with simple directions that could be administered and 
graded quickly. J. B. Carroll reports that short tests of integrated 
language performance have been found to be just as valid if the items 
are complex and cover a wide sample of linguistic competence as a test 

3 

that has each item contrived to cover one and only one specific feature. - ’ 
I J.B. Carroll, "The Psychology of Language Testing”, Language,. Te s ti ng 
Symp o s ium , A Psycho-linguistic Approach, edited by Alan Da/ies, Oxford 
University Press, 1968, page 56 . 

Carroll also says that we usually test productive and receptive skills 
separately because they are less likely to be highly correlated. I 
did not do this. There has been very little research to date on 
second-language skill testing and correlation of productive and re- 
ceptive skills. In my field there have been no standarized tests 
simple enough to administer that measure these skills independently. 

Like most California non-academic adult students taught by trained 
ESL teachers who use second-language materials, our students can read 

patterns that they have been overdrilled on in class even though no 

2 

formal instruction has been given in reading. 

2 There seems to be a great deal of misunderstanding and confusion about 
a student’s ability to read patterns he has learned orally. Wilga Rivers 
in her book. Teaching Foreign Language Skills , University of Chicago 
Press, 1968, discusses this problem thoroughly on pages 217-220. Charles 

Fries in Teaching and Learning English as a Foreign Langua ge, University 
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of Michigan Press, 1945, referred to the problem in a footnote on 
page 136. See also Roy Steeves and Patricia H. Cabrera, Handbook 
for Teachers of English as a Second Language , Americanization-Literacy , 
Califox da State Department of Education, 1969? pages 18, 21 and 24. 

While I made an objective multiple choice test that required a 
student to look at a printed form of a pattern he had practiced in class, 
and choose the pattern he thought was correct, my distractors (wrong 
1 answers) differed from those in the usual English multiple choice tests 
designed for native born speakers of English. For six years, I had kept 
a card file of errors my students made in patterns. My distractors came 
from those errors made before students learned a correct pattern. For 
this reason my test (although in a written form) tends to test what 
students think is correct from their oral practice and conversation. 

(A statistician in the testing office at San Francisco State College 
remarked that those errors were similar to those deaf children made.) 

When I began teaching ten years ago, our school was a full-time 
adult day school attached to the adult high school program. At that 
time we had five levels of instruction with six hours of classes arranged 

S into periods of 2 hours each. Students ranged in age from 18 - 80 , had 
a wide range of education, and represented about 60 different language 
groups. They usually lived with people who spoke their native or first 
language and may or may not have had some kind of exposure to English. 
Many were not highly motivated, committed students and were not anxious 
and willing to learn English actively. 



Teachers chose their own text books and decided when a student 
was ready for a higher level, which could happen anytime during the 
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semester. No grades were given and students enrolled in class and 
left at will at all points during the class semester. There was much 
shopping around. Some students became attached to one teacher and 

stayed with him a number of terms . If teachers did not maintain a 

/ 

daily attendance of 14, 'their classes were dropped and they often lost 
their jobs. 

Many of the teachers were traditional English High School teachers 
and attempted to teach our second-language learners with high school 
materials for native born students or with children’s readers and 
spellers. Some of the teachers were speech or high school foreign 
language teachers who used traditional materials designed years ago 
for immigrants or second-language learners. A few were trained or 
being trained in the audio-lingual system, and adapted second-language 
materials used in overseas programs or in courses for foreign born 
students enrolled in American colleges . 

At first no one could agree on what was to be taught at which 
level. Many of us held our classes on our personality, showmanship, 
sociability, and interest in helping the students learn English. Some 
of us met and discussed what we could do to realistically help our 
students learn English. Many of us enrolled in linguistic courses. 

William Tresnon, one of our registrars at that time and now the 
principal of our school, felt the same concern we did and asked us to 
form committees to solve two basic problems: 

1. What to teach at each level. 

2* How to place new students in the levels. 

Prom the committee groups, Mr. Tresnon along with James Norris, now 
a registrar at Pacific Heights Adult School in San Francisco, wrote 

a curriculum guide, A Sequential Course of Study in English for the 
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Foreign Born , San Francisco Unified School District, 1967. This 



helped those of us attempting to make a placement test. 

Our first efforts were voluntary. Three of us worked on a 
committee to develop an achievement test which we administered to 
classes at the first three levels. We then regrouped the students 

and the teachers decided which text books would be used at which 
levels. This test had taken a long time to develop and to grade and 
it became compromised. A teacher at another school obtained a copy of 
it and used it as a lesson review. Copies of the test floated through 
our school. Our committee lost interest in developing another test 
until we could have some assurance the test’s security would be 
protected. We also moved to another building without a large room 
where we could administer the test and control the examination. 

Some of us continued to make our own tests so that we could 
change students erroneously placed in our classes as quickly as 
possible. These tests were cumbersome to grade. Once again, we 
tried to administer an achievement type test in our lower level 
classes, but met with resistance from some teachers who refused to 
proctor adequately, and from others who feared some kind of negative 

judgment of their classes. 

In March, 1966, Mr. Dalton Howatt, the adult coordinator of 
the San Francisco Unified School District gave me some released time 
to develop an objective instrument to place students in our classes. 

I was also allowed to administer all tests and maintain necessary 
test security. At first, I hoped to develop listening and writing 
tests as well as grammar structure tests, but finally I limited myself 
to the grammar structure tests. We decided to make pretests (with 
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two forms) for each level. I had a pamphlet written by David Harris 
which became a bible of procedure. 

3 David, P. Harris, English Testing Guidebook, Part II, Fundamentals 
of Test Construction and Interpretation , American University Language 
Center for the International Cooperation Administration, June, 1961 . 

Mr. Robert Breckenridge , one of our teachers who had worked on testing 
in the Human Resources Research Center, U.S.A.P. answered many of my 
questions . 

Some teachers at the school contributed their achievement tests 
and volunteered to grade papers. I analyzed all books used at each 
level and made sample tests from which I obtained student's errors. 

Prom these I made lists of from 200-300 suggested items for an 
objective test that covered patterns taught at each level. Again 
teachers at our school voluntarily analyzed the items and made 
additions or suggestions. 

Then for each level, I made two final sample objective tests 
containing from 50-75 items each, which I gave to students. Next 
by item analysis, I constructed for each level, two one-page pretests 
that could be administered to large groups of students at one time 

1 

and that could be graded in less than a minute per test. Each page 
of legal size mimeograph paper contained 30 three-choice items. 

Students read each item, considered their choices (A.B.C.) 
and put an X on the alphabet letter proceeding the item they thought 
correct. Before beginning the test, students answered trial questions 
to determine if they understood how to take the test. Not until 
everyone could follow the simple instructions did we begin. Students 
then had 15 minutes to do 30 items. Sample trial items are as follows: 
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A , I ±s_ here . 

B , I am here . 

C . I are here . 

2. What is that? 

A. It is a book. 

B. He is a book. 

C. She is a book. 

We were careful to select items that represented various grammar 
problems and that were statistically the most discriminating and 
reliable. By June, 1968, I had developed eight pretests, (two forms 
for each of our first four levels) and had a bank of 300 suggested 
items for our level five. Reliability on the tests were from .74 to 
.84 and form correlations at each level were from .76 to .86. Dr. 
Henry Clay Lindgren from San Francisco State College supervised the 
statistical work on my pretests. (See Attachment D for pretest 
statistics . ) 

By 1968 more of our teachers had had linguistic training; many 
had Masters Degrees in teaching English as a Second Language. We 
were beginning to have a good sound ESL program and had six levels 
of English. We were now a separate Adult language center school with 
an enrollment of about 600 students. Students still attended 
voluntarily. We gave no grades. Teachers chose their own text books 
and decided when a student was ready for a higher level at any tjme 
during the term. Students still entered our classes at any time 
during the term, but our school became so popular that we had waiting 
lists of people wanting to enter our overfilled classes. 

Using the pretests I developed and with the help of teachers 
in the school who helped grade and administer tests, we began to set 
norms for new students by testing students in already existing classes 
of more or less homogeneous abilities. We still had some problems, it 
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was easy to place a new student who spoke no English at all or who 
knew a lot of vocabulary but no structures * If a student spoke some 
English, however, which level test should we give tne student? Where 
should we give the test? Who would grade and assign the students to 
classes? We still had no counselors, no large rooms available, and 
no way of maintaining test security. Federal funds had been cut and 
I no longer had any released time. 

In the meantime, other schools had become interested in our 
program and wanted to use the tests in their schools. 

Mr. Roy Steeves of the California State Department of Education 
had been interested in our tests and with George Johnson of the San 
Francisco Unified School District obtained a federal grant for the 
district from the Southwestern Cooperative Educational Laboratory in 
Albuquerque, New Mexico to: 1. Make a short standardized placement 

test of two forms which would place students into the first three levels 
of English-Second -Language classes, 2. Set State norms on the new 
standardized test, and 3. Finish 400-500-600 pretests. 

Results of the work done under that grant are as follows: 

1. Standardized Test - from the 180 pretest items, 

Mrs. Jeanette Best and I selected 100 items. We 
gave this new test to 263 students at Alemany Adult 
School in January, 1969* From this test Mr. Richard 
Reyes and his assistant, Mr. Fred Gillette, from San 
Francisco State College made two tests of 50 questions 
each which we now call E'PT 100-200-300 Forms A and B. 

Dr. Roger Cummings from San Jose State College super- 
vised all statistical work. Mr. Reyes report 
(attachment A) provides further statistical informa- 
tion about the test. 
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2 . Norms 



In selecting places to give the test, we wrote letters 
to a number of school districts asking them to volunteer to be part 
of the testing program. Since drop out rates are high in adult 
programs and since students enter and leave all during the term, we 
needed an area with a large number of students in the program. We 
also needed to test groups using English-Seqond-Language learning 
materials rather than literacy or other first language materials . 

Mrs. Jean Jacobs from Fremont Adult School In Sacramento 
and Dr. George Jensen from the Imperial Valley School District In 
El Centro volunteered for the program. Fremont Adult School has both 
day and night school heterogeneous classes. Students come from many 
language backgrounds. The Imperial Valley has only night school classes 
with almost all of the students speaking Spanish as a first language. 
Professor Alice Benz of the American Language Institute at San Francisco 
State College and Professor Dorothy Danielson from the Foreign-Born 
College English program at San Francisco State College also gave the 
test to their students. (See attachment C for a comparison of mean 
scores of all groups tested.) 

Since school districts refer to their classes in different 
ways we grouped the classes by the textbook used in the class and 
referred to the levels as 100-200-300 to avoid confusion with grade 
levels referred to by first language teaching methods. 

We were not surprised to learn that our test was too easy 
for the college groups, but we did not anticipate that except for 
Calexico one of the schools in the Imperial Valley, all regular level 
100 classes would fall into a range of means from 8.11 - 20.00, that 
all level 200 classes would fall into a range of means from 20.13 to 
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27 . 95 * and that all level 300 classes would fall Into a range from 
29.80 - 37.13. 

We tested in February, the beginning of a term and again in 
May, the end of the term because we hoped to do a gains study. We 
wanted to determine the effects of different school environments on 
language learning ability and to investigate the feasibility of using 
EPT 100-200-300 as a measure of achievement over the period of one 
semester. Such a study can be made on the students at Alemany Adult 
School, but because a large number of students in the migrant worker 
Imperial Valley area had left school before the end of the term to 
follow the lettuce crop in another area and students at Sacramento 
had taken jobs for the summer, the sampling of students in those areas 
taking tests both in February and in May are too small for a reliable 
study. A pilot gains study revealed, however, that EPT 100-200-300 can 
be used as an achievement test over a period of a semester. Significant 
gains were made by all levels in all geographical areas. (See 
Attachment C . ) 

3. 400-500-600 pretests 

By June, 1969* Mrs. Jeanette Best and Mrs. Virginia 
Biagl at Alemany finished the pretests for levels 
five and six with reliabilities ranging from .54 
to *.87 and form correlations ranging from .77 to 
.85. They also tested other adult schools in our 
area and obtained norms for each level. However, 

SWCEL decided not to renew the grant to put these 
texts into a booklet which would place the students 
in the last three levels of English (400-500 -*■ 600). 

(See Attachmen' D for pretest statistics.) 



Coming full circle, our last efforts have been voluntary in 
making two forms of an experimental test for the last three levels. 

We call this test EPT 400 - 500-600 Forms X & Z. We do not have the 
excellent help of Mr. Reyes to choose the items and we must rely 
again upon schools having upper level classes and people trained 
in testing and handling test security volunteering to give Forms 
X & Z and set norms. Fortunately the American Language Institute, 

San Francisco State College English Program for Foreign Born Students, 
and City College in San Francisco, as well as Cambria Adult School in 
Los Angeles will test their students in February, 1970 and send us 
the results so that we can revise and refine our upper level test. 

Another problem we have on both tests is protecting the security . 
There are some in the adult education field who feel that since part 
of the work was done under a Federal Grant, the test should be 
available to anyone interested in looking at it or using it with no 
arrangements for protecting test security. Some feel it should not 
be restricted in any way and should be placed like a book or article 
on a shelf in a reference library. 

We also have a problem checking the validity of the tests because 
we do not have any outside form of comparison. Teachers in Adult 
Education classes do not give or keep grades for their students. Many 
teachers send all students on to the next level whether they have passed 
or failed the work. On pretests for levels 1 and 2 , we ask«d teachers 
to group students in the top 25$, middle 50 $ and bottom 25$ before the 
students took the tests. A t test showed differences were in the ex- 
pected direction and probability was in every case better than .01. 

We also plan to compare our upper level student scores on current 
standardized tests for first-language learners and on tests designed 
for College Foreign born students. 
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We also have a test administration problem in our school. 

We have found that some teachers are unable to give a standardized 
test and maintain necessary test security without special training. 

We also have no place or personnel available to test new students 
entering our program after the classes are established. At the 
present time, we give the tests at the end of a term to all of our 
students to see if they are ready for the next level. Many new 
students take the tests at this time. A teacher’s judgment obviously 
comes first and a few regular students are sent on to a higher level 
even if scores are not high enough because we feel it is important 
for a student to feel he is learning and successful. 

/ 

s 

A teacher committee is trying to work out a plan wtiere all 
new students would be assigned quickly to a class opr^a temporary 
basis. Then one day every week, one teacher co^d take a double 
class while the released teacher (trained ip/test administration) 
gave the test to the new students. / 

An example of my Pall 1969 clarSs which meets from 10:00 a.m. 
to 12:00 p.m. five days a week reflects a problem we now have after 
a term begins. While my average daily attendance was around 26, 
and my active enrollment averaged around 35 > 79 students passed 
through this class during the semester and 28 remained at the end. 
This was not a loss of 51 students since 21 of these remained in the 
school after being transferred ujd or down one or two times until they 
reached a level where they could work successfully and still be 
challenged. Seven were referred to other schools and colleges, 3 
moved out of the city, 4 found jobs, 6 were ill or had baby sitting 
problems. I don’t know what happened to 10 of the students. Perhaps 
if they had had a better placement at the beginning, they might have 
stayed in the school. 
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One of the major problems is that in addition to a placement 
test, teachers need diagnostic tests that will tell them where their 
students need work. Professor Danielson at San Francisco State 
College is working with a number of Master Degree candidates who 
hope to come up with some diagnostic tests. My placement test will 
not serve as a diagnostic test. It also has no value for placing 
students in literacy classes. The test, however, has been used in 
placing junior high students in bilingual ESL programs. 

Hopefully the work we have done at Alerrany Adult School in San 
Francisco will be helpful to ethers interested in placing students 
at structural levels they can handle, but still find challenging. 
Hopefully someone will develop diagnostic tests in all areas, production 
as well as reception. 

In summary, while we still have many problems in administering 
and using our placement tests, we have one placement test for our 
lower levels that has been standardized and which has a high reliability 
(as well as an experimental form for the upper levels) that place our 
students in classes better than previous methods. We still have to 
move a student or two , but not the large numbers of students we did 
before. The morale in the school is better. In short, both students 
and teachers are happier when placements are made more accurately. 
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TESTING REPORT by RICHARD REYES 

on 

ENGLISH-SECOHD-LANGUAGE PLACE! ©IT TEST for LEVELS 100-200-300 






Purpose : 

The purpose of this study was to develop two equivalent forms of an English 
placement exam design, specifically for non-English speaking subjects , 

The exam will be oriented entirely toward grammar, pl acement . and will utilize 
knowledge of written grammar as the differentiation factor in placement* 

The exam will consist of a maximum of 50 items with a half-hour time limit. 
All questions will be of objective nature with three alternatives. 

The exam is designed to place students in levels 1-3 ou -t a possible six 
levels. A similar exam will be developed to place students in levels 4 - 6. 

Test items will come from a pool of ISO items previously developed and 
validated by the Alemany Adult Education School by Donna Ilyin. 



METHOD (pretest) : 

From past statistical work, a total of 110 questions were selected from the 
original pool of 180 items developed at Alemany School. The basic rules for 
aolootion of items were: 

1) Equal number of items from each of the three levels involved 
(l - 3 levels) • 

2) Item difficulty should be in the range of 50 to SO percent. 

3) Item reliability should be greater than .45. 

4) Items should be equal as far as re p resenting an. area o_f grammar 
ng&e^ary^o„;aa£teri^ 

The 110 items were then split into two forms using the rules governing the 
selection of particular items as much as possible when applicable. 

These two forms were then administered to a total of 263 subjects. 

The purpose of this administration was to determine the equivalence of the 
two forms and to determine the reliability of each form. The 263 subjects 
were equally distributed within levels 1 - 3« 

The results of the test administration were as follows: 

1) Correlation between form A and B was .93 » 

2) Reliability of form A was .92. 

3) Reliability of form B was ,92. 

4) Overall reliability of forms A and B scored together was .96. 

As indicated earlier a maximum of 50 items was desirable due to time 
limitation in administration. ■ To accomplish this, five items from each form 
of the exam were eliminated using the same rules as those used in arriving 
at the original figure of 55. After this was accomplished the two forms of 
the exam were rescored. The results of the rescoring indicated little change 
in the original statistics. The results were as follows: 

1) Correlation between forms - .93* 

2) Reliability of form A - ,91 . 

3) Reliability of form B - .92. 

4) Overall reliability of forms A and B - .95. 








In order to assure that the two forms were equivalent at each of the levels 
a product moment correlation was obtained for each level between forms A and B. 
The results were as follows: 

1) Level 1 - 

2) Level 2 - Rf= f 92 

3) Level 3 - B=.90 

From the results, it was concluded that the two foiras were in fast equivalent 
at each of the desired levels , 

Reliabilities at each of the levels were not computed due to the reduced 
number of students, however, overall reliability of forms A and B were 
sufficiently high enough (.91 and #92) to conclude that the .50 item tests were 
reliable. These reliabilities will be computed on the initial administration 
of each form* 

METHOD (First Examinatio n) s 

In order for this examination to be of value, a norm group was necessary. 

In an attempt to establish these norms for the examination, a total of 243 
students were tested at Alemany School * ^Students wsre . randomly assigned forms 
and given a total of a half hour to complete the examination. 

Results of the administration are shown in the following tables 



w 




MEAL 


SD 


SSH 


EELIABX.LITX 


33 


Level 3 


33.79 


6,09 


2.83 


.73 


44 


Level 2 


27.14 


7.73 


3.05 


.84 


44 


Level 1 


17.66 


7.59 


2.97 


.85 


41 


Level 3 


34.46 


7.03 


2.35 


.84 


44 


Level 2 


26.48 


6.79 


3.09 


.79 


37 


Level 1 


19.03 


7.48 


3.04 


.84 


121 


Form A 


25.5 


9.74 


3.02 


.90 


122 


Fona B 


26.9 


9.40 


3.06 


.89 



Results of this administration .support the results of the pretest data. 

Reliability ranged from a low of ,78 to a high of ,85. Forms A and B produced 
reliabilities of .90 and *89 respectively. The reason for the lower level 
reliability at each of the different levels is due in part to the reduced 
number of students in each group, 

*#Alemanv Adult School has an active enrollment of about 600 foreign students 
per semester from about sixty different countries* In Hay, 1969 the median age 
was 24.2, but the range was from 17 to 80 years, more than half of these students 
had not finished the 12th year in their own countries, but 197 had college 
training and 22 were college graduates. 

About half of the students had little or no English instruction before coming to 
the United States, and many of the others had only a workbook type instruction 
leaving them with little ability for reading comprehension or to speaker under- 
stand the simplest conversations. Most of the students lived with families or 
relatives who probably spoke their native language. 



Means and standard deviations were basically similar at each of the levels. 
Level one shows variation, but this was not significant and is due in part to 
the limited sample in this level* The combining of the levels also failed to 
influence the mean difference between forms A and B. A tost of significance 
indicated no difference between the moans of the two forms* 



From the previous page results, it was concluded that both forms could be 
combined to form one percentile range. This was done mainly for the convenience 
of teachers using the examination. The percentile range is as follows: 

ALL LEVELS COMBINED FOR MS ALEUAilY SCHOOL PREOEKTILES COMBINED 



SCORE. 


FREQUENCY 


CUMULATIVE FREQUENCY 


PREC ENTILE 


50 


0 


243 


99 


49 


2 


243 


99 


40 


1 


241 


99 


47 


1 


240 


99 


46 


0 


239 


98 


45 


2 


239 


98 


A 4 

T T 


2 


237 


98 


43 


1 


235 


97 


42 


1 


234 


96 


41 


4 


233 


95 


40 


5 


229 


94 


39 


6 


224 


92 


33 


2 


218 


90 


37 


11 


216 


89 


36 


7 


205 


84 


35 


6 


198 


81 


34 


10 


192 


79 


33 


9 


132 


75 


32 


Q 

/ 


173 


71 


31 


10 


164 


67 


30 


5 


154 


63 


29 


5 


149 


61 



o 







ALL LEVELS COliBIilED FOULS 


ALEMANI SCHOOL 


PEEC Eli TILES COMBINED 


10HE 


FREQUENCY 


CUMULATIVE FLEQUELTCI 


PREC ENTILE 


28 


8 


144 


59 


27 


4 


136 


56 


26 


11 


132 


54 


25 


10 


121 


50 


24 


12 


in 


46 


23 


7 


99 


41 


22 


9 


92 


33 


21 


9 


33 


34 


20 


7 


74 


30 


19 


11 


67 


28 


18 


11 


56 


23 


17 


8 


45 


19 


16 


6 


37 


15 


15 


7 


31 


13 


U 


5 


24 


10 


13 


4 


19 


8 


12 


0 


15 


6 


11 


2 


15 


6 


10 


1 


13 


5 


9 


2 


12 


5 


8 


2 


10 


4 


7 


1 


8 


3 


6 


1 


7 


3 


5 


2 


6 


3 


4 


1 


4 


2 


3 


1 


3 


1 


2 


0 


2 


1 


1 


2 


2 


1 



1 V 

|eric 






Attachment B 



k> 

BOOKS USED IN E.S.L. 100, 200, AND 300 



E.S.L. 100 



Bernardo, Leo U . & Pantell, Dora F., Englis h Your New Language, 

Book 1, Silver-'Burdett and Company, 19577’ 

Breckenridge, Robert, Book I , publishing pending with McGraw Hill. 

Dixson, Robert J., Modern American English , Book 1, Regents 
Publishing, 1962 . 

Dixson, Robert J., Regents English Workbook , Regents Publishing, 

New York, 1957^ ~ 

English Language Services, English 900 , Book 1 & 2, The MacMillan 
Company, New York, 19647 

English Language Services, English This Nay , Books 1 and 2, The 
Macmillan Company, New York, 1963 • ’ 

English Language Services, Intensive Course in English , Elementary, 
Part 1, English Language Services, 1963 . 

King, Harold V. and Campbell, Russel N., Modern English Primer , 

Part I, English Language Services, Inc., 1956. 

National Council Teachers of English, English for Today , Book I 
McGraw Hill, 1962. 



E.S.L. 200 



Dixson, Robert J., Modern American English, Book 2, Regents 
Publishing, 19727 

English Language Services, English 900 , Books 2 & 3, 

The MacMillan Company, 1964. 

English Language Services, E nglish This Way , Books 3 & 4, 

The MacMillan Company , 19 63 . 

English Language Services, Intensive Course in English , 

Elementary, Part 2, English Language Service, Inc., 1963* 

Hall, J., Practical Conversation in English Intermediate , 

Regents' Publishing Company, I 967 . ' — 

National Council of Teachers of English, English for Today , 

Book 2, McGraw Hill, 1962 . 

Wright, Audrey L. and McGillivray, James H., Let*s Learn English , 
American Book Company, i 960 . 

Fries, American English Series , Book 3, D.C. Heath and Co., 
Boston, 1953. 



E.S.L. 300 



Bernardo, Leo U. & Pantell, Dora F., English Your New Language , 
Book 2, Silver-Burdgett and Company, 196? • 

Binner, Yinal 0., American Folktales I , Thomas Crowell Co., 1968 . 

Breckenridge, Robert, Breckenridge Book II , publishing pending 
with McGraw Hill. 

Dixson, Robert J., Complete Course in English , Book I, Regents 
Publishing, 1955. 

Dixson, Robert J., Modern American English , Book 3 > Regents 
Publishing, 1962 . 

English Language Services, English 900 , Book 4, The MacMillan 
Company, 1964. 

Franklin, Harry B., Meikle, Herbert G., Strain, Jeris E., 

Vocabulary in Context, University of Michigan Press, 1964. 



o 



Attachment B - p. 2 



Lado, Robert & Fries, Charles C., English Pronunciation & 

English Sentence Patterns, University of Michigan ^ ’ 

Taylor, Grant. Learning America'n English , McGraw Hill Book Company, 

x95 °' Lydia A., Reader's Digest Skill Builder 



Wilcox, 

§ 



Lillian 
3, Part 



A. 
2 > 



; Thomas , 
Reader 1 s 



Digest Services, 

WrightT~Audrey , Barrett, Ralph P., Van Syoc , 
Let T s Learn English, Book 4, American 



Inc . , 1959 • 

W. Bryce, 

Book Co . , 1968 . 
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ATTACHMENT D 



STATISTICS FOR ENGLISH PLACEMENT TEST PRETESTS 



Level 100 Tests 
A. Forms L & M 

Sample: 100 Level I students at the end of the semester from various lan- 

guage backgrounds who attended Alomany Admit School i*n Hay, 1967. 
The term began February 2, 1967. 



Maximum possible raw score 
Mean 

Standard deviation 
Reliability using Kuder, formula 20 
Semi-interquartile range 
Pearson Product - Moment correlation between Forms L & M 



Form L 


Form M 


30 


30 


16.47 


16. hi 


5.96 


6.05 


.85 


.85 


2.33 


2.55 



= .80 



Validity: 



Since no grades are given to students enrolled in Aleman*# Adult 
School no data was available of the type usually used in 
validity studies. To make the rating problem as simple as 
possible for the teachers, they were asked to assign students 
to one of three groups - HIGH (consisting of the top 25 per- , 
cent), MID (consisting of the middle 50 percent), and LOW (con- 
sisting of the low 25 percent). In order to control for bias, 
teachers were asked to make these ratings before they had seen 
the test scores for their students. 



Tests of significance between the mean scores for each of these three groups 
were computed by means oft scores. Differences were all in the expected 
direction and probability was in every case better than .01. 



T TEST RESULTS 



FORM L 



FORM M 





Mean 


Sigma 


t 


Mean 


Sigma 


t 


Hi 


22.25 


5»30 




23.55 


3.01 




Mid 


16.33 


3-73 




15.95 


4.20 




Low 

Hi/Mid 


12.76 


5*09 


4.57 


12.52 


4.92 


8.2 


Hi/Low 






8.23 






9.43 


Mid/Low 






3.00 






2.86 



s 



- 1 - 



. 



ERIC 






B. Forms A, B, C, & D 

Teachers analysed a list of £ 0 t suggested test items prepared by the test 
developer. They made suggestions and additions. A trial test . was made 
consisting of 82 items and given to 124 students in the beginning levels. 

A comparison of items missed by the top 27 $ (32 students) with the 27 $ low 
(32 students) yielded a list of discriminating items. 

Two experimental placement tests were made from the 60 best discriminating 
items and called Forms C and D. (Two other Forms A and B were discarded 
as unsatisfactory.) 

In October, 1966, a Pearson Product-Moment correlation between Forms C and 
D on a sample of 86 Level I students was .69. and on a sample of 86 Level 
II students was 

Sample: 57 Level II students at the beginning of the semester from 

various language backgrounds attending Alatpaagr School. in Marqh, 

1967. 



Maximum possible raw score 
Mean 

Standard deviation 
Reliability 

Semi-interquartile range 



Form C 


Form D 


30 


30 


21.34 


19.88 


5 . Cl 


4.95 


.82 


CO 

t>- 

• 


2.12 


2.51 



Validity: Differences in mean scores between Hi, Mid and Low groups were 

mainly in the expected direction. Except for the differences 
in the t test between the Mid/Low for Form D, probability was 
better than .05 in one case and better than .01 in the remain- 
ing. 

T TEST RESULTS 





Mean 


FORM C 
Sigma 


t 


Mean 


FORM D 
Sigma 


t 


Hi 

Mid 

Low 

Hi/Mid 


24.08 

22.00 

18.64 


1.25 

4.95 

4.16 


5.94 

4.19 

2.62 


23.73 

20.26 

17.36 


4.24 

4.24 

5.39 


2.22 

3,08 

1.57 



II Level 200 Tests 
A* Forms N & 0 

Sample : 53 Level II "students at the end of the semester from various .lan- 
guage' backgrounds who attended John Adams Amex in May, 1967* term 

began February 2 , 1967 . 



lERJT 






- 2 - 





Form N 


Form 0 


Maximum possible raw score 


30 


30 


Mean 


18.04 


17.II 


Standard deviation 


4.63 


00 

0- 

• 


Reliability using Kudar formula 20 


•7^ 


.75 


Semi- .Interquartile range 


2.35 


2.38 



Pearsam* P^'daot- Moment correlation between Forms N & 0. » .86 

NOTE: The administration of the test was given under less than standard 

conditions. It was not adequately proctored due to a mi sunder « 
standing on the part of the proctors, however, the results are not 
inconsistent with previous forms. (See the statistics for part IX 
Forms A and B, page 4) 

Validity: Differences in mean scores between Hi, Mid and Low groups were 

mainly in the expected direction. Except for the difference 
in the comparison of Hi means with Mid means, probability was 
better than .01. 



T TEST RESULTS 







FORM N 






FORM 0 






Mean 


Sigma 

■■MT-WUikiA 


t 


Mean 


Sigma 


t 


Hi 

Mid 

Low 

Hi/VfcLd 

Hi/Low 

Mid/Low 


20.23 

13.64 

14.00 


4,78 

3.89 

3.81 


1.00 

3.^7 

3.27 


19.25 

17.36 

13.27 


4.83 

4.86 

3.54 


1.08 

3.38 

2.74 


Forms A, B, C, & D 
Suggested items analyzed by 


teachers = 


189 for 


pretests. 




Trial Placement 


Tests 






Part I 


Part 



Items 


87 


87 


Sample : # Students 


177 


110 


Number of 2J% high scores analyzed 


49 


30 


Number of 2]% low scores analyzed 


49 


30 



Four experimental placement tests were then made each containing 30 three 
choice items and called Forms A, B, C, & D. 

Reliability correlations (r) between Forms A, B, C, & D and the sample 
number of students (N) in October, 1966 were as follows: 



Form A/B 



Level 


II 


Level III 


r = 


.83 


r a .64 


N « 


61 


N = 33 


r m 


.65 




N = 


59 





Form C/D 





Level II 


Level 


III 


Form A/C 


r a 


.62 








N a 


72 






Form A/D 


r = 


.61 








N * 


42 






Form B/C 


r a 


.61 


r a 


.79 




N a 


41 


N a 


14 


Form B/D 


r = 


.58 


r a 


.66 




N a 


44 


N a 


10 



Sample; 50 Level III students at the beginning of the semester from ver- 
±0*3 language backgrounds who attended Alemany School in March, 
196 ?. 





Form A 


Form B 


Maximum possible raw score 


20 


20 


Mean 


20. 10 


20.54 


Standard deviation 


5.06 


5.07 


Reliability 


.81 


.82 


Serai-4.nterquartile range 


2,22 


2.17 



Validity's Due to the practice of grouping students into ability sections 
which are not always consistant throughout the day, at Level 
III, only one teacher* s ranking of students per class was used 
for the t testa. The differences tended in the expected direc- 
tion. Except for the t test for the difference of means 
between the Mid/Low group in Form A and the Mid/Hi group in 
Form B, probability ranged from .05 to better than .01. 

T TEST RESULTS 







FORM A 






FORM B 






Mean 

•Mb*** mm 


Sigma 


t 


Mean 


Signa 


t 


Hi 


24.00 


4,21 




22.71 


5.99 




Mid 


19.39 


5.00 




19.56 


5.12 




Low 

Hi/Mid 


15.33 


4.27 


2.31 


15.22 


2.01 


1.62 


Hi/Low 






4.03 






3.25 


Mid/Low 






1.93 






2.46 



III No standardization work or form correlations were performed on Level 300 
tests, but teachers analyzed 230 questions . 



Trial Placement Test 
Items 

Sample; # Students 
Number 27 % high scores 
Number 27$ low scores 

Placement test Forms A and B were made 

-im- 



part I Part II 



75 


75 


67 


67 


18 


18 


18 


18 



for this level but not standardized. 



o 



1 



s 



IV Level 400 Tests 
A. Forms R & S 



Sample: 51 Level IV students 

terra June, 1969. 


at Alemany tested at the 


end of the 




Form R 


Form S 


Maximum possible raw score 


50 


30 


Mean 


20.06 


19.06 


Standard deviation 


4.53 


4.62 


Reliability 


.74 


•15 


Semi-interquartile range 


2.31 


2.32 


Pearson Product - Moment correlation between R & S .66 


Sample: 23 Level V students 

the term June, 1969. 


at Pacific Heights tested 


at the end of 




Form R 


Form S 


Maximum possible raw score 


30 


30 


Mean 


19.57 


18.00 


Standard deviation 


5.04 


4.86 


Reliability 


.79 


•11 


Semi-interquartile range 


2.29 


2.34 



Pearson Product - Moment correlation bewteen R & S .68 
B. Forms A, B, C, & D 

Trial Placement Tests A, B, C were made from a list of 46 suggested items 
and four variations of four of the items. 

1. Samples 60 Level IV students at the end of the terra. May, 1967. 

Items 50 

Number 27$ high scores analyzed 16 
Number 27 % low scores analyzed 16 

Forms A and B were exactly alike except for a typing variation. Form C 
is a slight variation of B because we did not have enough discriminating 
items. In May, 1967 reliability correlations between Forms B and C on a 
sample of 2£ IV A students was .87 and on a sample of 2^ IV B students 
was .76 . 

2. Sample: 52 Level IV students at the end of the semester from various 

language backgrounds attending Alemany Adult School in May, 1967. 





Form B 


Form C 


Maximum possible raw score 


30 


30 


Mean 


19.76 


19.90 


Standard deviation 


5.14 


5.26 


Reliability 


.82 


.82 



ERIC 



5 



Semi-interquartile range 



Form B 

2.20 



Form C 
2.26 






Validity: Because of ability grouping practices which are not consistent 

throughout the day, only one teacher* s ranking of students per 
class was used for the t tests. The difference of mean scores 
between Hi, Mid and Low were in the expected direction and 
probability was better then .01. 

T. TEST RESULTS 







FORM B 






FORM C 






Mean 


Sigma 


t 


Mean 


Sifima 


t 


Hi 


25.85 


2.30 




25.17 


2.07 




Mid 


19.00 


%.08 




19.^8 


4.84 




Low 


15.17 


if. 31 




16.58 


3.47 




Hi/Mid 






6.57 






5.13 


Hi/Low 






7.56 






7.34 








2.59 






2.12 



Suggested items were taken from lower pretest items too difficult for pre- 
vious levels and used for Form D 

1. Sample: 8o Level IV students at Alemany tested at the end of the term 

May, 1968. 

Trial Placement Test - Spring, 1968 

Items 75 

Number 27$ high scores analyzed 21 

Number 27$ low scores analyzed 21 

Placement Tests C (from 1967) and D (from Spring 1968) 

2. Sample: J 2 Level IV students at Alemany tested at the end of the term 



May, 1968. 


Form C 


Form D 


Maximum possible raw score 


30 


30 


Mean 


19.00 


20.88 


Standard deviation 


4.66 


5.66 


Reliability 


.75 


.85 


Semi-interquartile range 


2.51 


2.22 



Pearson Product - Moment correlation 

between C and D None provided by statistician 



- 6 - 






Level 500 Tests 
A. Placement Tests T & TJ 



term June, 1969. 



Maximum possible raw score 
Mean 

Standard deviation 
Reliability 

Semi-interquartile range 

Pearson Product - Moment correlation 
between T and U 



tested at the 


end of 1 


Form T 


Form TJ 


50 


50 


17,56 


18.59 


5.64 


4.94 


.54 


.78 


2.47 


2.34 



.77 



2 . 



Samples 32 Level VI students at Pacific Heights tested at the end of 
the term June, 1969. 





Form T 


Form TJ 


Maximum possible raw score 


50 


30 


Mean 


20.25 


20.63 


Standard deviation 


5.32 


4.42 


Reliability 


.81 


.75 


Semi-interquartile range 


2.32 


2.26 



Pearson Product-Moment correlation 
between T and TJ 



.82 



B. Placement Tests A & B 

Suggested items were taken from a compilation of about 200 questions. 

1*. Sample: 50 Level VI students at Alemany tested at the beginning of 

the term February, 1969. 

Trial Placement Tests - the beginning of the term February, 1969 



Items 

Number 27# high scores analyzed 
Number 27# low scores analyzed 



75 

14 

14 



2. Sample: 34 Level VI students at Alemany tested in the middle of the 



term March, 1969. 


Form A 


Form B 


Maximum possible raw score 


30 


30 


Mean 


20.85 


2e.89 


Standard deviation 


3.81 


4.15 


Reliability 


.63 


.75 


Semi-interquartile range 


2.32 


2.07 



Pearson Product - Moment correlation 
between A and B 

- 7 - 



None (because statistician did 

not do correlation on the same 
students) 



I 



VI Level 60C? Tests 

A# Suggested items taken from compilation of about 200 questions. 

X. Sample : 70 Level VI students at Alemany tested beginning of term 

Spring, 1969* 

Trial Placement Tests - beginning of Spring term 19^9 



2 . 



Items 

Number Z]fo high scores analyzed 
Number 27$ low scores analyzed 

Sample; 37 Level VI students at Alemany tested at the end of the 
term June, 19^9* 



75 

20 

20 



3. 



Placement Tests A and B 

Maximum possible raw score 
Mean 

Standard deviation 
Reliability 

Semi-interquartile range 

Pearson Product - Moment correlation 
between A and B 



Form A 

50 

21.92 

3.95 

.68 

2.21 



Form B 
30 

21.00 

5.83 

.65 

2.28 



,80 



Sample: 13 Civics students at Pacific Heights tested at the end of 

the term June, 19^9 • 



Maximum possible rawsjore 
Mean 

Standard deviation 
Reliability 

Semi-interquartile range 

Pearson Product - Moment correlation 



Form A 


Form B 


30 


30 


18.23 


19.15 


6.15 


5.25 


.87 


.82 


2.21 


2.2% 



between A and B 



.85 



ERIC Clearinghouse 



FEB 2 



5 h/0 



on Atiuit Education 
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